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The photoelectron properties of DNA and RNA bases are studied using many-body perturba- 
tion theory within the GW approximation, together with a recently developed Lanczos-chain ap- 
proach. Calculated vertical ionization potentials, electron affinities, and total density of states are in 
good agreement with experimental values and photoemission spectra. The convergence benchmark 
demonstrates the importance of using an optimal polarizability basis in the GW calculations. A 
detailed analysis of the role of exchange and correlation in both many-body and density-functional 
theory calculations shows that while self-energy corrections are strongly orbital-dependent, they 
nevertheless remain almost constant for states that share the same bonding character. Finally, we 
report on the inverse lifetimes of DNA and RNA bases, that are found to depend linearly on quasi- 
particle energies for all deep valence states. In general, our GoWo-Lanczos approach provides an 
efficient yet accurate and fully converged description of quasiparticle properties of five DNA and 
RNA bases. 
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I. INTRODUCTION 

Understanding the photoelectron properties of DNA 
and RNA bases and strands is of central importance to 
the study of DNA damage following exposure to ultravi- 
olet light or ionizing radiation^, and to the development 
of fast DNA sequencing techniques^ and DNA and RNA- 
based molecular electronics and sensors^ ^. Extensive ex- 
perimental efforts^ ^^ have been made since the 1970's 
to measure the photoelectron properties of DNA and 
RNA bases. Meanwhile, theoretical calculations on their 
ionization potentials and electron affinities have been 
carried out using density- functional th eory ( DFT) and 
high-level quantum chemistry methoda^EoEMill^ How- 
ever, the results from DFT calculations are highly depen- 
dent on exchange-correlation functionals, and quantum 
chemistry methods, though more accurate, require con- 
siderably more computational effort. In contrast, many- 
body perturbation theory within Hedin's GW approxi- 
matiorP^Ell presents a unique framework that allows ac- 
cess to both quasi-particle (QP) energies and lifetimes 
on the same footing. This method has been successfully 
applied to quasi-one-dimensional(lD), two-dimensional 
(2D) and three-dimensional (3D) semiconductors, insu- 
lators, and metals^^H^, and very recently to molecular 
systemPHSH. 

In this work, we present the entire QP spectrum of 
DNA and RNA bases using many-body perturbation the- 
ory within Hedin's GW approximation, obtained with 
a recently developed approach that is parti cularly effec- 
tive in reaching numerical convergenc d^^ ^ ^^ i In the GW 
approximation, the self-energy operator is expressed as 
a convolution of the QP Green's function G with the 
screened Coulomb interaction W. Therefore, at increas- 



ing system sizes (as is the case for the present work) two 
computational challenges arise: (i) first, a large basis set 
has to be adopted to represent operators such as polariz- 
ability, and (ii) the calculation of the irreducible dynami- 
cal polarizability and that of the self-energy require sums 
over single-particle conduction states that converge very 
slowly. We overcome these two obstacles through (i) the 
use of optimal basis sets for representing the polariza- 
tion operators^ and (ii) the use of a Lanczos-chain algo- 
rithm^^ to avoid explicit sums over empty single-particle 
states. In addition, the GqWq approximation is adopted, 
in which the dynamical polarizability is calculated within 
the random-phase approximation and the QP Green's 
function is replaced by its unperturbed single-particle 
counterpart. This approach is implemented in the open- 
source QuANTUM-ESPRESSO distribution^^. It is ap- 
plied here to achieve fully converged QP spectra and in- 
verse lifetimes of the five isolated DNA and RNA bases 
and to investigate the important but distinct roles of ex- 
change and correlation in the GqWq self-energy correc- 
tions. 

Photoelectron properties of DNA and RNA bases us- 
ing many-body GW have not been reported until a very 
recent study by Faber et al.^^. The work by Faber et 
al. presents a many-body GW study on QP energies 
(including ionization potentials and electron affinities) of 
DNA and RNA bases at several levels of self-consistency 
within the GW approximation. Their calculations were 
based on the conventional implementation of the GW 
method using a localized basis set and a direct sum-over- 
states approach, and it demonstrated that self-consistent 
GW calculations indeed further improve the results of 
GqWq (one-shot GW) calculations. Although the local- 
ized basis set can significantly improve the computational 
efficiency and a direct sum-over-states approach can be 



easily implemented, both of them could have several po- 
tential drawbacks, and can introduce large errors in QP 
energies. One may solve the former issue by systemat- 
ically increasing the size of basis sets; however, there is 
no simple solution to the convergence problem introduced 
by the direct sum-over-state approach. Second, dipole- 
bound conduction states will not be obtained from lo- 
calized basis sets due to their highly diffuse character in 
the vacuum region. In fact, only electron affinities of 
covalent-bound conduction states were reported in the 
Faber work. Therefore, it would be desirable to calculate 
GW QP energies in a plane- wave basis without suffering 
from the above issues, which is one of the subjects of this 
research. 

The paper is organized as follows. Computational de- 
tails of our calculations are given in Sec. |ll| Real- space 
representations of optimal polarizability basis are dis- 
played in Se c. |III[ We then report convergence bench- 
mark in Sec. |IV[ In Sec. [V| we present QP energies and 
inverse lifetimes as well as the entire QP spectra for all 
five DNA and RNA bases, including guanine (G), ade- 
nine (A), cytosine (C), thymine (T), and uracil (U). Ver- 
tical ionization potentials (VIPs) and vertical electron 
affinities (VEAs) are compared to experimental data and 
other theoretical results. Two types of VEAs are re- 
ported using plane- wave basis, including valence-bound 
(VB, also called covalent-bound) VEA and dipole-bound 
(DB) VEAs. In Sec. VL we reveal the role of exchange 
and correlation in GW self-energy corrections to the DFT 
Kohn-Sham eigenvalues. Finally, we summarize our work 
in Sec. [Vni 



II. COMPUTATIONAL DETAILS 
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FIG. 1: (color online). Ground-state structures of five DNA 
and RNA bases including G9K-guanine, adenine, Cl-cytosine, 
thymine, and uracil. 

Ground-state DFT calculations are performed in 
a cubic supercell of 18.0^ A^, using the Perdew- 
Burke-Ernzerhof's (PBE) exchange-correlation func- 
tional, Troullier-Martins's norm-conserving pseudopo- 
tentials, and a plane- wave basis set with a cutoff of 544 
eV. Structures are optimized with a residual force thresh- 
old of 0.026 eV/A. A truncated Coulomb potential with 
radius cutoff of 7.4 A is employed to remove artificial 
interactions from periodic images. The vacuum level is 
corrected by an exponential fitting of £^homo with re- 
spect to the supercell volume. The polarizabihty basis 
sets have been obtained using a parameter £^* of 136.1 
eV and a threshold g* of 0.1 a.u., giving an accuracy 



of 0.05 eV for the calculated QP energies (^* and g* 
will be explained in the next section). The final accu- 
racy including the errors from the analytic continuation 
is about 0.05 to 0.1 eV. The structures of five DNA and 
RNA bases are shown in Fig. [l] Here the effect of gas- 
phase tautomeric forma^^ of guanine and cytosine on QP 
properties are beyond the scope of this work, and we only 
focus on the G9K form of guanine and the CI form of 
cytosine^. 



III. OPTIMAL POLARIZABILITY BASIS 

The key quantity in many-body GW calculations is 
the irreducible dynamic polarizability Pq in the random- 
phase approximation: 
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where 77 is an infinitesimal positive real number, \^l)v^|)c) 
denotes the direct product of a valence state ifj^ and a 
conduction state ijjc in real space and ifj^ and ?/^c are con- 
sidered to be real. A strategy was proposed in Refs. [27l 
and [28] for obtaining a compact basis set, referred to as 
optimal polarizability basis^ to represent Pq at all fre- 
quencies. First, we consider the frequency average of 
Po{uj) which corresponds to the element at time t = 

of its Fourier transform Po(^), without considering the 
constant {—i): 



Po{t = 0)=Y,\^v^c){^c^v\ 
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We note that Po{t = 0) is positive definite. Then, the 
optimal polarizability basis^ {^m}^ is built from the most 

important eigenvectors of Po(^ = 0), corresponding to 
the largest eigenvalues q^ above a given threshold q*\ 



Po(t = 0)|$^) = 9p|*^). 



(3) 



It must be noted that this does not require any explicit 
calculation of empty (i.e., conduction) states as we can 
use the closure relation: 

Pc = l-A, (4) 

together with an iterative diagonalization scheme. How- 
ever, the latter procedure would build polarizability ba- 
sis sets which are larger than what is necessary for a 
good convergence of the quasi-particle energy levels. This 
stems from treating all the one-particle excitations on 
the same footing, independent of their energy. A practi- 
cal solution would be to limit the sum in Eq. Q on the 
conduction states below a given energy cutoff E* : 
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However, limiting the sum over the empty states laying 
in the lower part of the conduction manifold does not 
allow to use the closure relation alluded to above. 

Thus, to keep avoiding the calculation of empty states 
we replace them in Eq. ([5| with a set of plane waves 
{G} with their kinetic energies lower than E* ^ which are 
first projected onto the conduction manifold using Eq. Q 
and then orthonormalized. We indicate these augmented 
plane-waves as {G} and arrive at the following modified 
operator: 



P^' = J2\^vG){G^,\ 



(6) 
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which is also positive-definite. An optimal polarizability 

basis {<l>^} is finally obtained by replacing Po{t = 0) in 

Eq. ^ with P^^ 

It should be stressed that the above approximation is 
used only for obtaining a set of optimal basis vectors for 
representing the polarization operators and not for the 
actual calculation of the irreducible dynamic polarizabil- 
ity at finite frequency in Eq. Q; the latter is performed 
using a Lanczos-chain algorithm^. Moreover, due to the 

completeness of the eigenvectors of P" ^ for any value of 
E'* the GW results will converge to the same values by 
lowering the threshold q* , and eventually reach the same 
results as those obtained by directly using a dense basis 
of plane- waves. However, compared to the pure plane- 
waves which are completely delocalized in real space, the 
optimal polarizability basis is particularly convenient for 
isolated systems since the most important eigenvectors 

of pQ will be mostly localized in the regions with higher 
electron density. Thus, converged results can be obtained 
using much smaller optimal-polarizability basis sets than 
plane- waves basis sets. 



these provide well converged results. It is clearly seen 
that the eigenvalues of the optimal polarizability basis 
decay exponentially and change by almost four orders of 
magnitude from the first to the last basis. In Fig. |3] we 
show the real-space representations of a few selected el- 
ements. The first five, corresponding to the five largest 
eigenvalues, are strongly localized around the chemical 
bonds of the molecule. The second row contains five ele- 
ments which are more delocalized, and those in the last 
row are completely delocalized. This indicates that even 
though localized optimal bases like those shown in the 
first two rows can be easily captured by localized basis- 
sets, the delocalized ones with smaller eigenvalues q^ (like 
those in the last row) are more difficult to capture if dif- 
fuse functions are not employed. 
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FIG. 3: (color online). Real-space representation of optimal- 
polarizability basis elements for cytosine, labeled with their 
eigenvalue indexes. Due to the delocalized nature of the op- 
timal basis in the third row, the images in the third row were 
generated with a smaller isovalue and shown at a larger scale 
than those in the first two rows. 



3000 



cu 2000 



.JOOO 




400 800 1200 1600 2000 
/i: index of optimal polarizability basis 



FIG. 2: (Color online) Eigenvalue distribution of the optimal 
polarizability basis for cytosine. The inset plot shows the 
eigenvalues in a log scale. 

Now, we want to have a closer look at the optimal po- 
larizability basis. The eigenvalue distribution of Pq for 
cytosine is displayed in Fig. |2J We only show the largest 
1600 eigenvalues with E^ = 136.1 eV in the plot, since 



IV. CONVERGENCE BENCHMARK 

The number of optimal-polarizability basis elements 
Np and the energy cutoff of the augmented plane- waves 
^* are two critical parameters used in our GqWo calcu- 
lations to achieve both efficiency and accuracy. There- 
fore, we performed a series of calculations to bench- 
mark the convergence with respect to these two param- 
eters. In Fig. [4J we present the convergence behavior 
of VIPs and VB-VEAs of five DNA and RNA bases for 
the highest-occupied molecular orbital (HOMO) and the 
lowest-unoccupied molecular orbital (LUMO), respec- 
tively: VIP = -Re(£g^Mo) and VEA = -Re(£^^Mo)- 
We find that for both VIPs and VEAs convergence within 
0.1 eV is achieved with ~ 600 optimal basis elements for 
£^* = 95.2 eV and with ~ 750 optimal basis elements 
for ^* = 136.1 eV. Indeed, similar trends were reported 
in Ref. [27| VIPs and VEAs reported in the following 
sections are calculated using the most strict parameters 
{Np = 2400 and £;* = 136.1 eV). 
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FIG. 4: (color online). Convergence benchmark of VIP and 
VEA of five DNA and RNA bases with respect to the number 
of optimal-polarizability basis elements Np, and augmented 
plane-wave cutoff E* . Results using ^* = 95.2 and 136.1 eV 
are plotted in dashed-blue lines and solid-red lines, respec- 
tively. 



The above benchmark indicates that, if basis- sets and 
conduction states in DFT calculations are not properly 
tested, one could easily obtain non-converged results 
from GqWq calculations, resulting in higher VIPs and 
lower VEAs for all five bases. We also note that the 
choice of Np and E* remains the same for all the DNA 
and RNA bases, indicating portability for these parame- 
ters. 
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FIG. 5: (color onUne). VIP and VB-VEA of five DNA and 
RNA bases from our DFT and GqWq calculations. Here we 
adopt the mean values of various experimental data listed in 
Table |l] The experimental ranges are 8.0 ^ 8.3, 8.3 ~ 8.5, 
8.8 - 8.9, 9.0 - 9.2, and 9.4 - 9.6 eV for G, A, C, T, and U, 
respectively. 



VIPs and VB-VEAs from our GqWq calculations and 
experimental data are shown in Fig. |5] for all five bases, 
together with the DFT-PBE eigenvalues for the HOMO 
and LUMO levels. Only the mean values of experimental 
VIPs and VEAs are plotted in Fig. [5] GqWo dramatically 
improves VIPs and VEAs compared to DFT-PBE eigen- 
values, providing VIPs of 7.64, 7.99, 8.18, 8.63 and 8.99 
eV and VEAs of -0.43, -0.25, -0.02, 0.24, and 0.23 
eV for G, A, C, T, and U, respectively. The experimen- 
tal VIPs are compiled in Table |lj and span a range of 
8.0 - 8.3, 8.3 - 8.5, 8.8 - 8.9, 9.0 - 9.2, and 9.4 - 9.6 
eV for G, A, C, T, and U. Compared to the mean values 
of experimental VIPs, the mean absolute error of the cal- 
culated VIPs for all five bases is 0.52 eV. Furthermore, 
experimental VB-VEAs are negative for all five bases, in- 
dicating that excited tt* states are unstable upon electron 
attachment. This leads to challenging measurements of 
VEAs and a wide range of measured valuea^^E^ listed in 
Table [I| -0.56 - -0.45, -0.55 - -0.32, -0.53 - -0.29, 

and -0.30 r 0.22, for A, C, T, and U. Compared to 

the mean values of experimental VEAs, the mean ab- 
solute errors of the calculated VEAs for four bases is 
0.45 eV. Interestingly, the VEA of guanine has never 
been measured successfully, possibly due to a large neg- 
ative value. This is clearly reflected in our calculated 
GqWq VEA of —0.43 eV, which is the most negative one 
among all five bases. Even though the GqWo VEAs of 
thymine and uracil are slightly positive, the trend for all 
the calculated VEAs agrees well with experiments. In 
addition, the DFT-PBE HOMO-LUMO gaps for the five 
bases are about 45% of the GqWq gaps. This is in agree- 
ment with previous observations that DFT with the local 
density approximation (LDA) or the generalized gradient 



approximation (GGA) of exchange-correlation function- 
als usual ly underestimates by 30-50% the true QP energy 
gaiPEi. 

We further compare several low- lying GqWq VIPs and 
their excitation characters with experimental and other 
theoretical results and assignments. First, as shown in 
Table [T| both GqWq VIPs and their orbital assignments 
agree well with experiments and other theoretical works 
for all the five bases, where the corresponding excita- 
tion character is either tt or n (lone pair). Second, our 
GqWq VIPs, especially those corresponding to the five 
HOMO levels, are in good agreement with Faber's GqWq 
values calculated in localized basis sets. However, larger 
deviations are clearly observed in some of the lone pair 
valence states. Their GqWo VIPs are higher than our 
values by 0.30, 0.40, 0.38, and 0.37 eV for HOMO-1 
(the first lone pair state) of cytosine, HOMO-1 (the first 
lone pair state) of thymine, and HOMO-1 and HOMO-3 
(the first and second lone pair states) of uracil, respec- 
tively. We plot in Fig. [6] the convergence behavior of 
VIPs with respect to the dimension of the polarizability 
basis for these lone pair states to check whether conver- 
gence issues are present. But it is apparent that VIPs 
from our GqWq calculations are fully converged. An- 
other significant difference is found in the valence-bound 
VEAs for all five LUMO levels. Moreover, Faber's GqWo 
VB-VEAs are lower than the present results by 0.61, 
0.39, 0.43, 0.38, and 0.34 eV for G, A, C, T, and U, re- 
spectively. It is interesting to notice that similar trends 
of increased VIPs and decreased VEAs are observed in 
the previous convergence benchmark of Fig. [4J when a 
small optimal polarizability basis was employed. How- 
ever, since we do not find significant difference in the 
GqWo VIPs for other QP states, the source of the above 
deviations is not clear. Furthermore, as listed in Table [l| 
the work by Faber et al. demonstrated the importance of 
self-consistency of QP energies in GW calculations with 
QP wavefunctions unchanged. This self-consistent GW 
method increases the GqWq VIPs of the HOMO levels 
by 0.32, 0.32, 0.52, 0.41, and 0.44 eV and decreases the 
GqWq VEAs of the LUMO levels by 0.54, 0.50, 0.46, 
0.53, and 0.53 eV for G, A, C, T, and U, respectively. 
Results from advanced quantum chemistry methods are 
also listed in Table [ij including complete active space 
with second-order perturbation theory (CASPT2/^2E1I, 
coupled-cluster with single s, dou bles, and perturbative 
triple excitations [CCSD(T)p^Ell, and equation of motion 
ionization potential coupled-cluster (EOM-IP-CCSD)^^. 
VIPs from CASPT2, CCSD(T), and EOM-IP-CCSD for 
the HOMO levels are very similar, and close to the ex- 
perimental mean values within 0.07, 0.07, and 0.05 eV, 
respectively. VEAs from CASPT2 and CCSD(T) for the 
LUMO levels are also close to each other; however, they 
are less close to the mean experimental values (within 
0.30 and 0.33 eV, respectively). Among all the theoretical 
approaches, self-consistent GW and quantum chemistry 
methods provide the VIPs and VEAs with the smaller 
errors with respect to the experimental data. 
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FIG. 6: (color online). Convergence behavior of vertical ion- 
ization potentials of several n states in DNA and RNA bases 
with respect to the number of optimal polarizability basis, 
Np^ and augmented plane- wave cutoff, ^*. These states 
are cytosine's HOMO-1 state, thymine's HOMO-1 state, and 
uracil's HOMO-1 and HOMO-3 states. Results using ^* = 
95.2 and 136.1 eV are plotted in dashed-blue lines and solid- 
red lines, respectively. 



Beside the VB-VEAs, there also exist dipole-bound 
(DB) VEAs, which correspond to having the additional 
electron weakly bound to the DNA and RNA bases by 
local electrostatic dipolea^^. Both types of QP states are 
shown in Fig. [7j It is clear that all five VB states are 
localized tt* states, while DB states present large lobes, 
highly extended outside the molecules. These lobes are 
mainly located in the vicinity of the N-H bond, and with 
a non-negligible dipole moment along their bond axis. 
The energy difference between the VB-VEAs and their 
nearest DB-VEAs, Avea = VEA(VB) - VEA(DB), are 
-0.23, 0.06, 0.21, 0.48, and 0.52 eV for G, A, C, T, and 
U, respectively. This suggests that at the GqWq level 
VB states in the latter four bases are energetically more 



TABLE I: Vertical ionization potentials and vertical electron affinities for several low-lying Kohn-Sham eigenstates close to 
the HOMO and LUMO levels obtained from negative DFT-PBE eigenvalues and G'oWo(PBE) in comparison with other GW 
and quantum chemistry calculations and experimental data. CASPT2: complete active space with second-order perturbation 
theory; CCSD(T): coupled-cluster with singles, doubles, and perturbative triple excitations; EOM: equation of motion ionization 
potential coupled-cluster. Only the valence-bound vertical electron affinities are shown in this table. The experimental mean 
values are taken as the reference in the calculations of mean absolute error(MAE) for both LUMO and HOMO levels. 
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FIG. 7: (color online). Valence-bound and dipole-bound 
VEAs and their corresponding QP states, calculated at the 
DFT-PBE level, in the five DNA and RNA bases. Values 
listed below are VEAs in the unit of eV. 



stable than the DB ones. 

Experimental valence photoemission spectra extends 
into deep valence state s^ 37 38^ allowing us to further eval- 
uate our G0W3 results at a broader energy range. The 
DFT-PBE and GqWq densities of states (DOS) for all 
five bases, neglecting any oscillator strength effect, are 
compared to valence photoemission spectra in Fig. [sja). 
For better comparison, both curves are shifted to match 
the first experimental VIP. It is clearly shown that for 
all five bases the Go Wo DOS agrees much better with 
the experiment than the DFT-PBE DOS, thanks to the 
correct relative position of the various peaks. Moreover, 
the Go Wo self-energy not only leads to large corrections 
to DFT eigenvalues, but also provides an estimation of 
QP intrinsic lifetimes due to inelastic electron-electron 
scattering, as reflected in the imaginary part of QP en- 
ergies, with I/tji = 2|Im(£^^)|. The calculated QP in- 
verse lifetimes at the Go Wo level are plotted in Fig.jsj^b) 
against the corresponding QP valence energies. Although 
Go Wo permits only a rough estimate of QP lifetimes 
(the exact ones are expected to be zero in the range 
[2Re(£:fjQj^Q), ^^(efjQj^Q)]), we note that the QP in- 
verse lifetimes decrease almost linearly with respect to 
QP energies for the deep valence states in all five cases. 
However, it is still unknown to what extent the Go Wo 
estimation of inverse lifetime would be modified by fully 
self-consistent GW calculations. 



VI. ROLE OF EXCHANGE AND 

CORRELATION IN GROUND-STATE DFT AND 

GW CALCULATIONS 

To understand the role of exchange and correlation in 
the self-energy corrections to the DFT-PBE results, we 
first express each Kohn-Sham eigenvalue e^^ of eigen- 
state il)n for the n-th state as the sum of a single- 
particle energy e^ and an exchange-correlation energy 
^n^- ^n^ ~ ^n + ^n^' whcrc s^ coutaius the energy con- 
tributions from the kinetic energy operator, the exter- 
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FIG. 8: (color online), (a) Experimental valence photoe- 
mission spectrum (shaded gray area), DFT-PBE DOS (blue 
dashed lines), and GqWq DOS (red soUd lines). Both DOS 
curves are shifted to match the first VIP of experimental data. 
Experimental PES spectra of G, A, C, T, and U are extracted 
from Refs. [37l |8| 13 p^, and "38 , respectively. The theoretical 
DOS have been obtained through a Lorentzian broadening de- 
fined by a width of 0.4 eV. (b) GqWq QP energies and inverse 
lifetime for valence states (unit: eV). 



nal ionic potential, and the Hartree term. Furthermore, 
the GqWq QP energy can be written in terms of the ex- 
change self-energy T^ and of the correlation self-energy 
^n- ^n°^° = ^n^^^ ^^n' Exchange and correlation 
effects can then be systematically investigated by analyz 

and plot the above quantities with respect to the GqWq 
QP energy £^°^°. As shown in Figs. 9[a) and (b), the 
GqWq exchange energy S^ increases from —28.0 to —17.8 
eV for the 25 valence states and from —9.2 to —0.2 eV 
for the 10 conduction states, while the GqWq correlation 
energy S^ decreases from 7.1 down to 0.2 eV for the va- 
lence states and from —0.3 to —3.1 eV for the conduction 
states. This clearly shows that H^ is always negative, 
stabilizing both electron and hole excitations; however, 
S^ is positive for valence states and negative for con- 
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FIG. 9: (color online). The role of exchange and correlation in 
the GqWo self-energy corrections to Kohn-Sham eigenvalues 
of 25 valence states and 10 conduction states in adenine, (a) 
GqWo exchange energy Sj^, (b) GqWq correlation energy S^, 
(c) the sum of GqWo exchange and correlation energy SjJ 
(filled symbols) and DFT XC energy £-^^ (unfilled symbols), 
and (d) the difference between GqWq and DFT exchange- 
correlation energy, AJ^ = Tin —^n • Four types of molecular 
orbitals are illustrated in (a)-(d), corresponding to ass, (Jsp, 
n, and tt characters. 



duction states, indicating that the effect of correlation is 
that of destabilizing hole excitations and of stabilizing 
electron excitations. Although H^ and H^ have opposite 
trends for hole excitations, exchange interactions eventu- 
ally dominate due to their larger magnitude, leading to 
the negative S^^ of Fig. [9[c). Interestingly, the GqWo 
S^^ is lower than the DFT-PBE e^^ for the valence 
states, but higher than e^^ for the conduction states. 
Consequently, the difference A^^ between S^^ and e^^ , 
shown in Fig. ^d), is negative for the valence manifold 
and positive for the conduction manifold, resulting in an 
increased HOMO-LUMO gap. The same behavior is ob- 
served for the other four bases as well. 

As shown in Fig. [9J we can recognize five major or- 
bital types among the valence and conduction orbitals 
of the isolated adenine molecule: (755, agp, n, tt, and 
dipole-bound states. The lowest six states correspond 
to a orbitals due to s-s hybridization, which have larger 
GqWq exchange, correlation, and total self-energy cor- 
rections than the other states. The following ten states 
at higher energy levels exhibit agp character, and their 
S^ and S^ show a linear but opposite dependence with 
respect to the GqWq QP energy 6^°^°. Thus, their sum 
S^^ is shown to be almost constant, ranging from —20.5 
to —19.4 eV. Since the same trend is present in 6^^, 
the final difference A^^ between GqWq and DFT results 
stays almost constant, between —3.3 and —3.0 eV. The 



next three n and six tt valence states and three tt* con- 
duction states have a similar behavior, despite different 
magnitudes in their self-energy corrections. In particular, 
the six TT valence states are lowered by about —2.5 eV, 
while the three tt* conduction states are lifted by 2.1 eV, 
leading to an increase of 4.6 eV for the HOMO-LUMO 
gap. The above observations provide an important ev- 
idence that the GqWq self-energy corrections are highly 
orbital-dependent and on average T^-^{ass) < ^'^{(isp) < 
S^(n) < S^(^), SC(a,,) > ^^{asp) > ^^(n) > ^^{tt), 
and A^^{ass) < ^^^{(Tsp) ^ A^^{n) < A^C(7r). Con- 
sequently, the commonly-used "scissor operator" to cor- 
rect bandgaps by rigidly lowering the valence levels and 
increasing the conduction levels by the same amount will 
never be adequate for describing the entire QP spectrum. 



VII. SUMMARY 

In summary, VIPs, VEAs, and DOS of five DNA and 
RNA bases obtained from a fully converged many-body 
GqWq approach are found to be in very good agree- 
ment with experiments and other theoretical works. Two 
types of vertical electron affinities are found, correspond- 
ing to localized valence-bound excitations and delocalized 
dipole-bound excitations. Our calculations further reveal 
that QP inverse lifetimes depend linearly on QP energies 
for the deep valence states. They, however, come from 
the zero-th order GqWq estimation, and may be signif- 
icantly affected in self-consistent GW calculations. In- 
terestingly, the GqWq self-energy corrections are highly 
orbital dependent, but remain relatively constant for the 
states with similar bonding character. Moreover, GqWq 
VIPs of lone pair states deviate from the experimental 
ones more than those for tt states. Whether this differ- 
ence comes from the different self-interaction errors in 
Kohn-Sham eigenstates will require further studies us- 
ing self-interaction corrected functionalP^^^ work is in 
progress along this direction. 
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